AITopics | Manufacturer

Collaborating Authors

Manufacturer

Interpretable Image Classification with Adaptive Prototype-based Vision Transformers

Neural Information Processing SystemsMay-29-2025, 10:09:20 GMT

This method classifies an image by comparing it to a set of learned prototypes, providing explanations of the form "this looks like that." In our model, a prototype consists of parts, which can deform over irregular geometries to create a better comparison between images. Unlike existing models that rely on Convolutional Neural Network (CNN) backbones and spatially rigid prototypes, our model integrates Vision Transformer (ViT) backbones into prototype based models, while offering spatially deformed prototypes that not only accommodate geometric variations of objects but also provide coherent and clear prototypical feature representations with an adaptive number of prototypical parts. Our experiments show that our model can generally achieve higher performance than the existing prototype based models. Our comprehensive analyses ensure that the prototypes are consistent and the interpretations are faithful. Our code is available at https://github.com/Henrymachiyu/ProtoViT.

artificial intelligence, deep learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.92)

Genre: Research Report > Experimental Study (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Health & Medicine (1.00)
(3 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Dense Connector for MLLMs

Neural Information Processing SystemsMay-29-2025, 04:32:58 GMT

Do we fully leverage the potential of visual encoder in Multimodal Large Language Models (MLLMs)? The recent outstanding performance of MLLMs in multimodal understanding has garnered broad attention from both academia and industry. In the current MLLM rat race, the focus seems to be predominantly on the linguistic side.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
Asia (0.28)

Genre: Research Report > Experimental Study (0.93)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Leisure & Entertainment (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(2 more...)

Add feedback

Score Distillation via Reparametrized DDIM

Neural Information Processing SystemsMay-28-2025, 23:05:04 GMT

While 2D diffusion models generate realistic, high-detail images, 3D shape generation methods like Score Distillation Sampling (SDS) built on these 2D diffusion models produce cartoon-like, over-smoothed shapes. To help explain this discrepancy, we show that the image guidance used in Score Distillation can be understood as the velocity field of a 2D denoising generative process, up to the choice of a noise term. In particular, after a change of variables, SDS resembles a high-variance version of Denoising Diffusion Implicit Models (DDIM) with a differently-sampled noise term: SDS introduces noise i.i.d.

diffusion model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > United Kingdom > England (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry:

Leisure & Entertainment (0.93)
Information Technology (0.68)
Automobiles & Trucks > Manufacturer (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

There's a Very Simple Pattern to Elon Musk's Broken Promises

WIREDMay-27-2025, 09:30:00 GMT

My predictions about achieving full self-driving have been optimistic in the past,

artificial intelligence, hyperloop, musk, (16 more...)

WIRED

Country: North America > United States > California (0.16)

Industry:

Automobiles & Trucks > Manufacturer (0.97)
Information Technology > Robotics & Automation (0.92)
Transportation > Passenger (0.84)
Transportation > Ground > Road (0.77)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.72)

Add feedback

e6d37cc5723e810b793c834bcb6647cf-Paper-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 15:01:42 GMT

artificial intelligence, celeb basis, machine learning, (19 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Genre: Research Report > Promising Solution (0.66)

Industry:

Information Technology > Security & Privacy (0.68)
Automobiles & Trucks > Manufacturer (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Add feedback

Faith and Fate: Limits of Transformers on Compositionality

Neural Information Processing SystemsMay-25-2025, 14:07:47 GMT

Transformer large language models (LLMs) have sparked admiration for their exceptional performance on tasks that demand intricate multi-step reasoning. Yet, these models simultaneously show failures on surprisingly trivial problems. This begs the question: Are these errors incidental, or do they signal more substantial limitations? In an attempt to demystify transformer LLMs, we investigate the limits of these models across three representative compositional tasks--multi-digit multiplication, logic grid puzzles, and a classic dynamic programming problem. These tasks require breaking problems down into sub-steps and synthesizing these steps into a precise answer. We formulate compositional tasks as computation graphs to systematically quantify the level of complexity, and break down reasoning steps into intermediate sub-procedures. Our empirical findings suggest that transformer LLMs solve compositional tasks by reducing multi-step compositional reasoning into linearized subgraph matching, without necessarily developing systematic problem-solving skills. To round off our empirical study, we provide theoretical arguments on abstract multi-step reasoning problems that highlight how autoregressive generations' performance can rapidly decay with increased task complexity.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Europe (0.67)
Asia > Middle East > UAE (0.14)
North America > United States > California (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Automobiles & Trucks > Manufacturer (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

An NLP Benchmark Dataset for Assessing Corporate Climate Policy Engagement

Neural Information Processing SystemsMay-25-2025, 02:29:35 GMT

As societal awareness of climate change grows, corporate climate policy engagements are attracting attention. We propose a dataset to estimate corporate climate policy engagement from various PDF-formatted documents. Our dataset comes from LobbyMap (a platform operated by global think tank InfluenceMap) that provides engagement categories and stances on the documents. To convert the LobbyMap data into the structured dataset, we developed a pipeline using text extraction and OCR. Our contributions are: (i) Building an NLP dataset including 10K documents on corporate climate policy engagement.

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Asia > India (1.00)
Asia > China (1.00)
Oceania > Australia (0.94)
(9 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation > Passenger (1.00)
Materials > Metals & Mining (1.00)
Law > Environmental Law (1.00)
(13 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

The Cybertruck was supposed to be apocalypse-proof. Can it even survive a trip to the grocery store?

The GuardianMay-14-2025, 11:00:06 GMT

The Cybertruck answers a question no one in the auto industry even thought to ask: what if there was a truck that a Chechen warlord couldn't possibly pass up – a bulletproof, bioweapons-resistant, road rage-inducing street tank that's illegal to drive in most of the world? Few had seen anything quite like the Cybertruck when it was unveiled in 2019. Wrapped in an "ultra-hard, 30X, cold-rolled stainless steel exoskeleton", the Cybertruck was touted as the ultimate doomsday chariot – a virtually indestructible, obtuse-angled, electrically powered behemoth that can repel handgun fire and outrun a Porsche while towing a Porsche, with enough juice leftover to power your house in the event of a blackout. At the launch, Tesla's CEO, Elon Musk, said the truck could tackle any terrain on Earth and possibly also on Mars – and all for the low, low base price of 40,000. "Sometimes you get these late-civilization vibes [that the] apocalypse could come along at any moment," Musk said.

artificial intelligence, cybertruck, tesla, (17 more...)

The Guardian

Country: North America > United States > California (0.15)

Industry:

Transportation > Ground > Road (1.00)
Transportation > Electric Vehicle (1.00)
Automobiles & Trucks > Manufacturer (1.00)

Technology: Information Technology > Artificial Intelligence (0.49)

Add feedback

Trump Wants to Bring Back Factory Jobs. I Worked on the Assembly Line. It Was Hell.

SlateMay-14-2025, 09:40:00 GMT

Sign up for the Slatest to get the most insightful analysis, criticism, and advice out there, delivered to your inbox daily. I once witnessed a friend going through a severe midlife crisis. Basically overnight, this formerly serious and well-adjusted middle-aged man dumped his wife for a much younger girlfriend, got a face tattoo, and built a full-sized halfpipe in his house. Soon, we were barraged with music recommendations (all stuff he'd listened to in high school and college) and life updates laden with "hip" "slang" ("Despite the age gap, my situationship with Triniteigh is lowkey lit"). It was a transparent--and, from a certain perspective, even sympathetic--response to a universal anxiety: He'd seen that the good times were over, and that only decline lay ahead. But, like all nostalgists, he didn't realize that you can't ever truly go back; you can only go backward. The United States, under President Donald Trump, seems to be undergoing a similar midlife crisis, as this reactionary administration attempts to brute-force the country back to a golden age that many people are realizing either didn't exist in the first place or has been permanently lost to the mists of time and modernization.

artificial intelligence, cookie, factory, (14 more...)

Slate

Country: North America > United States (1.00)

Industry:

Automobiles & Trucks > Manufacturer (0.68)
Government > Regional Government > North America Government > United States Government (0.48)
Health & Medicine > Therapeutic Area (0.46)

Technology: Information Technology > Artificial Intelligence (0.88)

Add feedback

Can new patrol vehicles crack down on 'video game-styled' driving in California?

Los Angeles TimesMay-9-2025, 00:23:31 GMT

The California Highway Patrol is deploying new patrol vehicles in hopes of cracking down on what the agency called "video game-styled" driving. The vehicles, 100 Dodge Durangos, will be paired with a fleet of Dodge Chargers and Ford Explorers to "observe the most reckless and dangerous behaviors without immediate detection," according to a CHP news release. "The new vehicles give our officers an important advantage," CHP Commissioner Sean Duryee said in a statement. "They will allow us to identify and stop drivers who are putting others at risk, while still showing a professional and visible presence once enforcement action is needed." The vehicles will be placed in various regions across the state starting this week.

artificial intelligence, california, video game-styled, (1 more...)

Los Angeles Times

Country: North America > United States > California (0.78)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Automobiles & Trucks > Manufacturer (1.00)

Technology: Information Technology > Artificial Intelligence > Games (0.64)

Add feedback